BIOTECHNOLOGY Sfi 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 




Application Serial Number; 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

hHp:/Avww.usnto.govAveb/ofriccs/pac/chcckcr/chkr4 lnotc.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that iherc 
a possibility that the disk/CD-Rom may have been affected by treatment j^yen to all incoming mail. 
Please consider using alternate inclhods of submission for tlfTtfisk/CD-Rom or replacement disk/CD-Rom. ~* * 
Any reply including a sequence listing in electronic fornrshoold NOToe sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to the indicated add resses: 

1. EFS-Bio (<httn:/Avww.usnto>gov/ebc/efs/downtoads/documents.htm> , EFS Submission 

User Manual - cPAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Piitcnl and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Pla/;i Two. 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, oe other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Rootu 4B03-Mai!room, Crystal Plaza Two, 201 1 South Clark Ptocc, Arlington, VA 22202 



Revised I0/0S/03 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES; PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH WFRE INSERTED BY PTO SOFTWARE 



I: 



1 



_Variablc Length 



_Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

.Misaligned Amino The numbering under each 5 Ih amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

_Non-ASCll The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved in ASCII text. 

Sequencers) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused.tbe <220>-<223> section to be missing from amino acid 

sequenccs(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

_Skipped Sequences Sequence(s) missing. If intentional, please insert the following lines for each skipped sequence 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Patentln 2.0 
" "bug" 



Skipped Sequences Sequence(s) 
(NEW RULES) 



<210> sequence id number 
<400> sequence id number 
000 



missing. If intentional, please insert the following lines for each skipped sequence. 



io J 



II 



JJseof n's or Xaa's 
(NEW RULES) 

Invalid <213> 
Response 

Useof<220> 



^Patentln 2.0 
"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses arc: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Sequences) 



. missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(Sec "Federal Register " 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n/Xaa "n" can only represent a single nucleotide : "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TTME: 09:44:17 



Input Set : A:\2004-02-25 4456-010lP.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A. raw 

3 <110> APPLICANT: Yamamura Ken-ichi 

4 Araki Kimi 

6 <120> TITLE OF INVENTION: TRAP VECTORS AND GENE TRAPPING USING THE SAME 
8 <130> FILE REFERENCE: 4456-0101P 

10 <140> CURRENT APPLICATION NUMBER: 10/030, 65RA 

11 <141> CURRENT FILING DATE: 2002-01-11 

33 <150> PRIOR APPLICATION NUMBER: JP99/200997 
14 <151> PRIOR FILING DATE: 1999-07-14 
16 <160> NUMBER OF SF.Q ID NOS : 17 

18 <170> SOFTWARE: Patentln Ver. 2.0 . • ' ; .; .Ilply 

20 <210> SEQ ID NO: 1 i , , / .\v 4 fijf^rjor* 

21 <211> LENGTH: 13 

22 <212> TYPE: DNA r\ 

23 <213> ORGANISM: Artificial Sequence Q t 

25 <22(» FEATURE: V 

26 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

28 <4 00> SEQUENCE: 1 

29 taccgttcgt ata 13 

31 <210> SEQ ID NO: 2 

32 <211> LENGTH: 13 

33 <212> TYPE: DNA 

34 <23 3> ORGANISM: Artificial Sequence 

36 <220> FEATURE: , 

37 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

39 <4 00> SEQUENCE: 2 

40 tatacgaacg gta 13 

42 <210> SEQ ID NO: 3 

43 <211> LENGTH: 34, 

44 <212> TYPE: DNA 

45 <213> ORGANISM: Artificial Sequence 
4 7 <22 0> FEATURE: 

4 8 <22 3> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

50 <4 0 0> SEQUENCE: 3 

51 ataacttcgt atagcataca ttatacgaag ttat 34 

53 <210> SEQ ID NO: 4 

54 <211> LENGTH: 13 

55 <212> TYPE: DNA 

56 <213> ORGANISM: Artificial Sequence 

58 <220> FEATURE: 

59 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

61 <400> SEQUENCE : 4 

62 ataacttcgt ata 13 
64 <23 0> SEQ ID NO: 5 



file://C:\CRF4\Outho[d\VsrJ030658A.htm 



RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02~25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\030420Q4\J030658A.raw 

65 <211> LENGTH: 13 

66 <25 2> TYPE: DNA 

67 <213> ORGAM TSM : Artificial Sequence 

69 <220> FEATURE: 

70 <223> OTHER INFORMATION: Description of. Artificial Sequence : synthet. i c DNA 

72 <400> SEQUENCE: 5 

73 tatacgaagt tat 13 
75 <210> SF.Q ID NO: 6 
7 6 <211> LENGTH: 34 
77 <212> TYPE: DNA. 
70 <213> ORGANTSIH^ ( t he * 
80 <220> FEATURE: 




81 <223> OTHER INFORMATION: Homologous recombination sequence 
8 3 <4 00> SEQUENCE: 6 

84 Laccgttcgt atagcataca ttatacgaac ggta 34 

86 <210> SEQ 10 NO: 7 

87 <211> LENGTH: 19 
8 8 <212> TYPE: DNA 

89 <213> ORGANISM: Artificial Sequence 

91 <220> FEATURE: 

92 <223> OTHER INFORMATION: Zl forward primer used in PGR for R-qeo detection 

94 <4 00> SEQUENCE: 7 

95 gcgttaceea acttaatcg 19 

97 <210> SEQ ID NO: 8 

98 <211> LENGTH: 18 

99 <212> TYPE: DNA 

100 <213> ORGANISM: Artificial Sequence 

102 <220> FEATURE: 

103 <223> OTHER INFORMATION: 7/2 reverse primer used in PCR for B-geo detection 

105 <400> SEQUENCE: 8 

106 tgtgagcgag taaeaacc 18 

108 <210> SEQ ID NO: 9 

109 <211> LENGTH: 22 

1 10 <212> TYPE: DNA 

111 <213> ORGANISM: Artificial Sequence 

113 <220> FEATURE: 

114 <223> OTHER INFORMATION: Ori2 forward primer used in PCR for detecting the 
replication 

115 origin region in pUC vector 

117 <400> SEQUENCE: 9 

118 gccagtggcg ataagtegtg tc 22 

120 <210> SEQ ID NO: 10 

121 <211> LENGTH: 21 

122 <212> TYPE: DNA 

123 <213> ORGANISM: Artificial Sequence 

125 <220> FEATURE: 

126 <223> OTHER INFORMATION: Ori3 reverse primer used in PCR for detecting the 
replication 

127 origin region in pUC vector 

129 <400> SEQUENCE: 10 

130 cacagaatca qgggataacg c 21 
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RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/lO/030 , 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-010lP-ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

132 <210> SEQ ID NC : 11 

133 <211> LENGTH : 400 

134 <212> TYPE: DNA 

135 <213> ORGANISM: Mus musculus 

137 <220> FEATURE: 

138 <221> NAME/KE*: raisc_f eature 

139 <222> LOCATION: (36).. (36) 

14 0 <223> OTHER INFORMATION: n is a, c, g, or t 

1 4 2 <220> FEATURE: 

143 <221> NAME /KEY : misc^feature 

144 <222> LOCATION: (70).. (70) 

145 <223> OTHER INFORMATION: n is a, c, g, or t 

147 <220> FEATURE: 

148 <221> NAME/KEY: misc^t eature 

149 <222> LOCATION: (362).. (362) 

150 <223> OTHER INFORMATION: n is a, c, g, or t 

152 <220> FEATURE: 

153 <221> NAME /KEY : miscjoature 

154 <222> LOCATION: (364).. (364) 

155 <223> OTHER INFORMATION: n is a, c> g, or t 

157 <220> FEATURE : 

158 <221> NAME /KEY : misc feature 

159 <222> LOCATION: (377).. (377) 

160 <223> OTHER INFORMATION: n is a, c, g, or t 
163 <400> SEQUENCE: 11 

W — > 164 agaaacttaa acagcggata aactteagtg at t tana tea gagaagtatt ggaagtgatt 60 

165 cteaaggtan ageaacageg gctaacaaca aaegtcaget tagtgaaaac cqaaagccct 120 

166 tcaacttttt gcctaLgcag attaatacta acaagagcaa ggatgetact gcaagtcttc 180 

167 caaagagaga gatgacaacg tcagcacagt gcaaagagtt gtttgettot gctctaagta 240 

168 atgacctttt gcaaaactgt caatctctga agaagatggg agaggggagc ctgcatggga 300 

169 aacaccagat tgtaagcagg ct.tgttcaat cctgactata ttaotaaagc tagttctatg 360 

170 cnanaagt.tt tgtaaanaaa atgaaagtct gcaatgttga 400 

172 <210> SEQ ID NO: 12 

173 <211> LENGTH: 416 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Mus musculus 

177 <220> FEATURE: 

178 <221> NAME /KEY : misc feature 

179 <222> LOCATION: (37)". .(37) 

180 <223> OTHER INFORMATION: n is a, c, g, or t 
182 <220> FEATURE: 

103 <221> NAME /KEY : misc_ feature 

184 <222> LOCATION: (363)" . . ( 363) 

185 <223> OTHER INFORMATION: n is a, c, g, or t 

187 <220> FEATURE: 

188 <221> NAME /KEY : raisn_f eature 

189 <222> LOCATION: (392).. (392) 

190 <223> OTHER INFORMATION: n is a, c, g, or t 
192 <220> FEATURE: 
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RAW SEQUENCE LISTING DATE; 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:17 



Input Set : A:\2004-02-25 4456-010lP.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

193 <221> NAME/KEY: ir.isc feature 

194 <222> LOCATION: (401)". . (401) 

195 <223> OTHER INFORMATION: n is a, c, g, or I 

197 <22 0> FEATURE: 

198 <221> NAME/KEY; mi sc_f eature 

199 <222> LOCATION : (403).. (403) 

200 <223> OTHER INFORMATION: n is a, o, g, or t 
203 <400> SEQUENCE: 12 

W — > 204 tcttctagct ttgcagcata aagcagagca agctatnagc tgtgatggat gactctgttg 60 

205 ttacagaaac tacagqaagc ttatctggag tcaqcstcac atctgaacta aatyaagaac 120 

206 tgaatqattt aattcagcqL ttccataatc agcttcgtga ttctcagcct ccagctgttc 180 

207 cagacaacag aagacaggca gaaagtcttt cattaactag agagatttct cagagcagaa 240 

208 atccctcagt ttctgaacat ttacctgatg agaaagtaca qctttttagc aaaatgagag 300 

209 tactacayga aaagaacaag aaatggacaa attagttggg agaacttcat aaccttcgag 360 

210 atnagcatct gaacaactca tcatttgtgc cntcaacttc ncnceaaaga agtggg 416 

212 <210> SEQ ID NO: 13 

213 <211> LENGTH: 484 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Mus musculus 

217 <220> FEATURE: 

218 <221> NAME /KEY: miscjeaturc 

219 <222> LOCATION: (33).. (33) 

220 <223> OTHER INFORMATION: n is a, c, g, or t 

222 <220> FEATURE: 

223 <221> NAME /KEY : mLscJeature 

224 <222> LOCATION: (48).. (48) 

225 <223> OTHER INFORMATION: n is a, c, g, or t 
227 <220> FEATURE: 

22B <221> NAME/ KEY : miscjeature 

229 <222> LOCATION: (54).. (54) 

230 <223> OTHER INFORMATION: n is a, c, q, or t 

232 <220> FEATURE: 

233 <221> NAME /KEY : misc feature 

234 <222> LOCATION: (89).. (89) 

235 <223> OTHER INFORMATION: n is a, c, g, or L 
2 37 <2.20> FEATURE: 

2 38 <221> NAME /KEY : mi sc_f eature 

239 <222> LOCATION: (244).. (24 4) 

240 <223> OTHER INFORMATION: n is a, c, g, or t 

242 <22 0> FEATURE: 

243 <221> NAME/KEY: miscfeature 

244 <222> LOCATION: (257).. (257) 

245 <223> OTHER INFORMATION: n is a, c, g, or t 
24G <400> SEQUENCE: 13 

W — > 249 gtttctacac ctactgaaca gcagcagcca ttnagctcaa aatccttnca gggnaaaaca 60 

250 gagtatatqg cttttccaaa accctctgna aagcagttct tctctLggag cagaaaagca 120 

251 aaggaatcaa gaaacaqccc gaagaggaag ctgaaaacac taagacacca tggttatatg 180 

252 atcaagaagg tggagtagaa aaaccatttt tcaagactgg atttacagag tctgtagaga 24 0 

253 aagnLacaaa atagtanccg caaaaaLcaa ccagatacaa gcaggagaaq acgtcggttt 300 
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RAW SEQUENCE LISTING DATE: 03/04/2004 

PATENT APPLICATION: US/10/030 f 658A TIME: 09:44:17 

Input Set : A:\2004-02-25 4456-01OlP.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.xaw 

254 gatgaaqaat cccttggaaa gctttagr.aq tatgectgat cctatagocc caacatcagt 360 

255 aactaaaaca tttaaaacaa gaaaagcatc Lycccaggcc agcctggcct ctaaggacaa 420 

256 sactcccaaa tcaaagagta agaaqaggat tctactcagc tgaaaagtag agttaaaaat 480 

257 attg 484 

260 <210> SEQ ID NO: 14 

261 <211> LENGTH: 211 

262 <212> TYPE: DNA 

263 <2l3> ORGANISM: Mus inuaculus 

265 <400> SEQUENCE: 14 

266 ctgtctgtca ttgtcgttct cctttagaag gcagaaaaqa aatgggaaga aaaaaygcaa 60 

267 aatctggaac actataaegg aaaggagttc gagaagctcc tggaggaagc tcaggccaac 120 

268 atcatgaagt caattccaaa cctggagatg cccccagctt ccagcccagt gtcaaaggga 180 

269 gatgeggcag gggataaget ggagctgtca g 211 

272 <210> SEQ ID NO: 15 

273 <211> LENGTH: 34 

274 <212> TYPE: DNA 

275 <213> ORGANISM: Artificial Sequence 

27 7 <220> FEATURE: 

278 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic DNA 

280 <400> SEQUENCE: 15 

281 taccgttcgt atagcataca tLatacgaag ttat 34 

283 <210> SEQ ID NO: 16 

284 <211> LENGTH: 34 

285 <212> TYPE: DNA 

286 <213> ORGANISM: Artificial Sequence 
288 <220> FEATURE: 

28 9 <223> OTHER I N FORMAT I ON : Description of Artificial Sequence : synthetic DNA 

291 <400> SEQUENCE: 16 

292 ataacttcgt atagcataca ttatacgaac gyta 34 

294 <210> SEQ ID NO: 17 

295 <211> LENGTH: 34 

296 <212> TYPE: DNA 

2 97 <2T3> ORGANISM: Artificial Sequence 
2 99 <220> FEATURF. : 

300 <223> OTHER INFORMATION: Description of ArLlficial Sequence : synthet i a DNA 

302 <400> SEQUENCE: 17 

303 tattgaagca tatcytatgt aatatgette aata 34 



file://C:\CRF4\Outhold\VsrJ030658A.htm 



3/4/04 



i age O oi 6 



RAW SEQUENCE LISTING ERROR SUMMARY DATE: 03/04/2 004 

PATENT APPLICATION: US/10/030 , 658A TIME: 0.9:44:18 

Input Set : A:\2004~02~25 4456-0101P.ST25.txt 
Output Set: N:\CRF4\03042004\j030658A.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:ll; N Poy. 36,70,362,364,377 
Seq#:12; N Pos . 37,363,392,401,403 
Seq#:13; N Pos. 33,48,54,89,244,257 
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VERIFICATION SUMMARY DATE : 03/04/2004 

PATENT APPLICATION: US/10/030 , 658A TIME: 09:44:18 

Input Set : A:\2004-02-25 4456-0l0lP.ST25.txt 
Output Set: N:\CRF4\03042004\J030658A.raw 

L:164 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:31 after pos . : 0 
M:34l Repeated in SeqNo=ll 

L:204 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:12 after pos -0 
M:341 Repeated in SeqNo=12 

L:249 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 after pos . : 0 
M:341 Repeated in SeqNo-13 
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